A Novel Online Encyclopedia-Oriented Approach for Large-Scale Knowledge Base Construction
نویسندگان
چکیده
In the process of constructing large-scale knowledge base, manual-based construction approach lacks efficiency as well as flexibility. Therefore, automatically extracting of massive knowledge from online encyclopedia has attracted attention from an increasing number of scholars. Current research is mainly focused on the extracting of data from English online encyclopedia, whereas research about knowledge extraction from Chinese or other language data sources is rare. For such reason, the present paper proposes an automatic construction scheme for large-scale knowledge base based on Chinese online Encyclopedia. (i)In the first phase of the scheme, selfexpanded learning is performed on the semantic relations between subjects and objects among the knowledge triples. (ii)In the second phase, semantic relations between the marked attributes and their entities is predicted using Conditional Random Fields (CRFs) and Support vector machine (SVM) classifier. A large-scale knowledge base is automatically constructed based on the scheme, and the experiment results indicate that the scheme possesses feasibility and effectiveness.
منابع مشابه
A Proposal for a Gene Functions Wiki
Large knowledge bases integrating different domains can provide a foundation for new applications in biology such as data mining or automated reasoning. The traditional approach to the construction of such knowledge bases is manual and therefore extremely time consuming. The ubiquity of the internet now makes large-scale community collaboration for the construction of knowledge bases, such as t...
متن کاملThe effect of language complexity and group size on knowledge construction: Implications for online learning
This study investigated the effect of language complexity and group size on knowledge construction in two online debates. Knowledge construction was assessed using Gunawardena et al.’s Interaction Analysis Model (1997). Language complexity was determined by dividing the number of unique words by total words. It refers to the lexical variation. The results showed that...
متن کاملOnline Aggregation of Coherent Generators Based on Electrical Parameters of Synchronous Generators
This paper proposes a novel approach for coherent generators online clustering in a large power system following a wide area disturbance. An interconnected power system may become unstable due to severe contingency when it is operated close to the stability boundaries. Hence, the bulk power system controlled islanding is the last resort to prevent catastrophic cascading outages and wide area bl...
متن کاملTowards Automatic Construction of Knowledge Bases from Chinese Online Resources
Automatically constructing knowledge bases from online resources has become a crucial task in many research areas. Most existing knowledge bases are built from English resources, while few efforts have been made for other languages. Building knowledge bases for Chinese is of great importance on its own right. However, simply adapting existing tools from English to Chinese yields inferior result...
متن کاملA Convenient Base-Mediated Diastereoselective Synthesis of 2-Oxo-N,4,6-triarylcyclohex-3-enecarboxamides via Claisen-Schmidt Condensation
Sodium acetate catalyzed the multi-component reaction of acetophenone, aromatic aldehydes, and acetoacetanilide in the water-ethanol mixture (1:1) at ambient temperature via Claisen-Schmidt condensation results in the formation of highly substituted cyclohexenones in 89–98% yields. The developed efficient catalytic approach to the substituted cyclohexenones – the promising ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JSW
دوره 9 شماره
صفحات -
تاریخ انتشار 2014